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FN THE SPECIFICATION 

At page 6, please replace paragraph(s) [00 1 9] with the following text: 

Accordingly, in one aspect, the invention features a nucleic acid molecule which encodes an 
80090, 52874, 52880, 63497, or 33425 protein or polypeptide, e.g., a biologically active portion of the 
80090, 52874, 52880, 63497, or 33425 protein. In a preferred embodiment, the isolated nucleic acid 
molecule encodes a polypeptide having the amino acid sequence of SEQ ID NO:2, 5, 8, 1 1, or 14. In 
other embodiments, the invention provides an isolated 80090, 52874, 52880, 63497, or 33425 nucleic 
acid molecule having the nucleotide sequence shown in SEQ ID NO: 1,3 ,4,6,7,9, 10, 12, 13 or 1 5 , or th e 

sequence of the DNA insert of the plasmid d e posited with ATCC Acc e ssion Number . In still other 

embodiments, the invention provides nucleic acid molecules that are substantially identical (e.g., naturally 
occurring allelic variants) to the nucleotide sequence shown in SEQ ID NO: 1 ,3,4,6,7,9, 1 0, 1 2, 1 3 or 1 S^ef 

the sequence of the DMA insert of the plasmid deposited with ATCC Acc e ssion Numb e r . In other 

embodiments, the invention provides a nucleic acid molecule which hybridizes under stringent 
hybridization conditions to a nucleic acid molecule comprising the nucleotide sequence of SEQ ID 
NO: 1,3,4,6,7,9,10,12,13 or 1 5 , or th e s e quence of the DNA insert of th e plasmid d e posited with ATCC 
Accession Number wherein the nucleic acid encodes a full length 80090, 52874, 52880, 63497, or 
33425 protein or an active fragment thereof. 

At page 7, please replace paragraph(s) [0024] with the following text: 

In other embodiments, the invention provides 80090, 52874, 52880, 63497, or 33425 
polypeptides, e.g., an 80090, 52874, 52880, 63497, or 33425 polypeptide having the amino acid sequence 
shown in SEQ IDNO:2,5,8,l 1 or 14; the amino aoid sequ e nce encoded by the oDNA insert of th e 
plnrunid d e posited with ATCC Accession Numb e r ; - an amino acid sequence that is substantially 
identical to the amino acid sequence shown in SEQ ID NO:2,5,8,l 1 or 14; or an amino acid sequence 
encoded by a nucleic acid molecule having a nucleotide sequence which hybridizes under stringent 
hybridization conditions to a nucleic acid molecule comprising the nucleotide sequence of SEQ ID 
NO: 1,3,4,6,7,9,10,12,13 or 1 5 , or th e s e quenc e of the DNA insert of the plasmid d e posit e d with ATCC 
Accession Numb e r — , wherein the nucleic acid encodes a full length 80090, 52874, 52880, 63497, or 
33425 protein or an active fragment thereof. 

At page 9, please replace paragraph(s) [0035]-[0037] with the following text: 
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Figure 4 depicts a BLAST alignment of human 80090 with a consensus amino acid 
sequence derived from a ProDomain No. PD313476, "CG4435" (Release 2001 .1; 
http^www.toulouse.inra.fr/prodom.html). The lower sequence is amino acid residues 40-21 7 of the 229 
amino acid consensus sequence (SEQ ID NO: 1 7), while the upper amino acid sequence corresponds to 
the "CG4435" domain of human 80090, amino acid residues 89-265 of SEQ ID NO:2. 

Figure 5 depicts a BLAST alignment of human 80090 with a consensus amino acid 
sequence derived from a ProDomain No. PD002778, "Transferase fucosyltransferase glycosyltransferase 
alpha- 12-fucosyltransferase galactoside transmembrance glycoprotein 3-L- signal-anchor golgi" (Release 
2001 . 1 ; http7//www.toulouse.inra.fr/prodom.html). The lower sequence is amino acid residues 62-200 of 
the 227 amino acid consensus sequence (SEQ ID NO: 1 8), while the upper amino acid sequence 
corresponds to the "Transferase fucosyltransferase glycosyltransferase alpha- 12-fucosyltransferase 
galactoside transmembrance glycoprotein 3-L- signal-anchor golgi" domain of human 80090, amino acid 
residues 221-367 of SEQ IDNO:2. 

Figure 6 depicts a BLAST alignment of human 80090 with a consensus amino acid sequence 
derived from a ProDomain No. PD323544, "CG9169" (Release 2001 .1 ; 

httpr//www.toulouse.inra.fr/prodom.html). The lower sequence is amino acid residues 129-424 of the 445 
amino acid consensus sequence (SEQ ID NO: 19), while the upper amino acid sequence corresponds to 
the "CG9169" domain of human 80090, amino acid residues 90-390 of SEQ ID NO:2. 

At pages 10-1 1, please replace paragraph(s) [0041]-[0043] with the following text: 

Figure 10 depicts a BLAST alignment of human 52874 with a consensus amino acid sequence 
derived from a ProDomain No. PD032606, "Receptor neurotensin coupled G-protein type transmembrane 
lipoprotein levocabastine- palmitate phosphorylation" (Release 2001.1; 

http^www.toulouse.inra.fr/prodom.html). The lower sequence is amino acid residues 12-76 of the 84 
amino acid consensus sequence (SEQ ID NO:22), while the upper amino acid sequence corresponds to 
the "Receptor neurotensin coupled G-protein type transmembrane lipoprotein levocabastine- palmitate 
phosphorylation" domain of human 52874, amino acid residues 253-320 of SEQ ID NO:5. 

Figure 11 depicts a BLAST alignment of human 52874 with a consensus amino acid sequence 
derived from a ProDomain No. PD 128 109, "Similar somatostatin receptors" (Release 2001 .1 ; 
http7#www.toulouse.inra.fr/prodom.html). The lower sequence is amino acid residues 160-267 of the 305 
amino acid consensus sequence (SEQ ID NO:23), while the upper amino acid sequence corresponds to 
the "Similar somatostatin receptors" domain of human 52874, amino acid residues 208-316 of SEQ ID 
NO:5. 



(Page 3 of 22) 



Practitioner's Docket No. MPI00-437P1RM 



Figure 12 depicts a BLAST alignment of human 52874 with a consensus amino acid sequence 
derived from a ProDomain No. PD145471, "C01G12.7" (Release 2001.1; 

httpr//www.toulouse.inra.fr/prodom.html). The lower sequence is amino acid residues 1 15-298 of the 599 
amino acid consensus sequence (SEQ ID NO:24), while the upper amino acid sequence corresponds to 
the "C01G12.7" domain of human 52874, amino acid residues 16-192 of SEQ ID NO:5. 

At pages 11-12, please replace paragraph(s) [0047]-[0050] with the following text: 

Figure 16 depicts a BLAST alignment of human 52880 with a consensus amino acid sequence 
derived from a ProDomain No. PD3 1 0793, "Receptor orphan GPR26 protein-coupled" (Release 200 1 . 1 ; 
httpr//www .toulouse.inra.fr/prodom.html). The lower sequence is amino acid residues 2-183 of the 205 
amino acid consensus sequence (SEQ ID NO:26), while the upper amino acid sequence corresponds to 
the "Receptor orphan GPR26 protein-coupled" domain of human 52880, amino acid residues 134-315 of 
SEQIDNO:8. 

Figure 77 depicts a BLAST alignment of human 52880 with a consensus amino acid sequence 
derived from a ProDomain No. PD155019, "Receptor type hypocretin EG:22E5.10 EG:22E5.1 1 
transmembrane coupled orexin G-protein" (Release 2001 .1 ; http7#www.toulouse.inra.fr/prodom.html). 
The lower sequence is amino acid residues 28-1 84 of the 227 amino acid consensus sequence (SEQ ID 
NO:27), while the upper amino acid sequence corresponds to the "Receptor type hypocretin EG:22E5.10 
EG:22E5.1 1 transmembrane coupled orexin G-protein" domain of human 52880, amino acid residues 
175-321 ofSEQIDNO:8. 

Figure 18 depicts a BLAST alignment of human 52880 with a consensus amino acid sequence 
derived from a ProDomain No. PD032094, "Receptor acid lysophosphatidic high-affinity homolog 
transmembrane novel rhodopsin similar" (Release 2001.1; http^www.toulouse.inra.fr/prodom.html). 
The lower sequence is amino acid residues 24-1 83 of the 202 amino acid consensus sequence (SEQ ID 
NO:28), while the upper amino acid sequence corresponds to the "Receptor acid lysophosphatidic high- 
affinity homolog transmembrane novel rhodopsin similar" domain of human 52880, amino acid residues 
171-322 of SEQ ID NO:8. 

Figure 19 depicts a BLAST alignment of human 52880 with a consensus amino acid sequence 
derived from a ProDomain No. PD322057, "NT2RM2000452 FLJ10317 Fis cDNA" (Release 2001.1; 
http7//www.toulouse.inra.fr/prodom.html). The lower sequence is amino acid residues 7-99 of the 388 
amino acid consensus sequence (SEQ ID NO:29), while the upper amino acid sequence corresponds to 
the "NT2RM2000452 FLJ10317 Fis cDNA" domain of human 52880, amino acid residues 221-329 of 
SEQ IDNO:8. 
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At page 13, please replace paragraph(s) [0054] with the following text: 

Figure 23 depicts a BLAST alignment of human 63497 with a consensus amino acid sequence 
derived from a ProDomain No. PD009900, "Receptor pheromone G-protein vomeronasal coupled M24 
VN1 VN3 VN2 VN4" (Release 2001.1; htt^/www.toulouse.inra.fr/prodom.html). The lower sequence 
is amino acid residues 5-264 of the 274 amino acid consensus sequence (SEQ ID NO:3 1), while the upper 
amino acid sequence corresponds to the "Receptor pheromone G-protein vomeronasal coupled M24 VN1 
VN3 VN2 VN4" domain of human 63497, amino acid residues 36-295 of SEQ ID NO: 11. 

At pages 14-15, please replace paragraph(s) [0058]-[0060] with the following text: 

Figure 27 depicts a BLAST alignment of human 33425 with a consensus amino acid sequence 
derived from a ProDomain No. PD301916, "Similar NT2RM2000363 cluster Fis FLJ10312 weakly 
cDNA breakpoint" (Release 2001 .1; http^www.toulouse.inra.fr/prodom.html). The lower sequence is 
amino acid residues 1-103 of the 153 amino acid consensus sequence (SEQ IDNO:33), while the upper 
amino acid sequence corresponds to the "Similar NT2RM2000363 cluster Fis FLJ 1 03 1 2 weakly cDN A 
breakpoint" domain of human 33425, amino acid residues 5 1 6-608 of SEQ ID NO: 14. 

Figure 28 depicts a BLAST alignment of human 33425 with a consensus amino acid sequence 
derived from a ProDomain No. PD000780, "GTPase activating similar GTPase-activating activation 
domain Fis zinc cDNA subunit" (Release 2001.1; httpr//www.toulouse.inra.fr/prodom.html). The lower 
sequence is amino acid residues 16-118 of the 161 amino acid consensus sequence (SEQ ID NO:34), 
while the upper amino acid sequence corresponds to the "GTPase activating similar GTPase-activating 
activation domain Fis zinc cDNA subunit" domain of human 33425, amino acid residues 390-486 of SEQ 
IDNO:14. 

Figure 29 depicts a BLAST alignment of human 33425 with a consensus amino acid sequence 
derived from a ProDomain No. PD215173, "RLIP" (Release 2001.1; 

httpr//www .toulouse.inra.fr/prodom.html). The lower sequence is amino acid residues 96-185 of the 186 
amino acid consensus sequence (SEQ ID NO:35), while the upper amino acid sequence corresponds to 
the "RLIP" domain of human 33425, amino acid residues 399-488 of SEQ ID NO: 14. 

At page 1 6, please replace paragraph(s) [0064]-[0065] with the following text: 

For general information regarding PFAM identifiers, PS prefix, and PF prefix domain 
identification numbers, refer to Sonnhammer et al. (1997) Protein 28:405-420 and 
httpr/Avww.psc.edu/general/ sofhvare/packages/pfam/pfam.html. 



(Page 5 of 22) 



Practitioner's Docket No. MPI00-437P1RM 



A plasmid oontaining tho nuolootido soquonoo encoding human 8 0090 was d e posit e d with 
American Typo Culture Collection (ATCC), 10801 University Boulevard, Manassas, V A 201 10 2209, on 

and aligned Accession NumW Th ' r dftpnr . it will hn mnintnin e d und e r th e terms of th e 

Budapest Treaty on th e International Recognition of tho Deposit of Microorganisms for tho Purpos e s of 
Patent Procedure. This deposit was mad e merely as a conveni e nce for thos e of skill in tho art and is not 
an admission that a d e posit is r e quired und e r 35 U.S.C. § 1 12. 

At pages 17-18, please replace paragraph(s) [007 1 ]-[0072] with the following text: 

To identify the presence of a "fucosyltransferase" domain in an 80090 protein sequence, and 
make the determination that a polypeptide or protein of interest has a particular profile, the amino acid 
sequence of the protein can be searched against a database of HMMs (e.g., the Pfam database, release 2.1) 
using the default parameters (http^www.sanger.ac.uk/Software/Pfam/HMM_search). For example, the 
hmmsf program, which is available as part of the HMMER package of search programs, is a family 
specific default program for MILPAT0063, and a score of 1 5 is the default threshold score for 
determining a hit. Alternatively, the threshold score for determining a hit can be lowered (e.g., to 8 bits). 
A description of the Pfam database can be found in Sonhammer et al. (1997) Proteins 28(3):405-420, and 
a detailed description of HMMs can be found, for example, in Gribskov et al.(1990) Meth. Enzymol. 
183:146-1 59; Gribskov et al.(1987) Proc. Natl. Acad. Sci. USA 84:4355-4358; Krogh et al.(1994) J. 
Mol. Biol. 235:1501-1531; and Stultz et al.(1993) Protein Sci. 2:305-314, the contents of which are 
incorporated herein by reference. A search was performed against the HMM database resulting in the 
identification of a fucosyltransferase domain in the amino acid sequence of human 80090 at about 
residues 35-395 of SEQ ID NO:2 (see Figure 1). 

An 80090 polypeptide can include a fucosyltransferase domain or regions homologous with a 
fucosyltransferase domain. As used herein, the fucosyltransferase domain includes an amino acid 
sequence of about 200-500 amino acid residues in length. Preferably, an fucosyltransferase protein 
domain includes at least about 250-450 amino acids, more preferably about 300-400 amino acids, or 
about 350-375 amino acids. The fucosyltransferase domain (HMM) has been assigned the PFAM 
Accession PF00852 (http^pfam.wustl.edu/). An alignment of the fucosyltransferase domain (amino 
acids 35-395 of SEQ ID NO:2) of human 80090 with a consensus amino acid sequence derived from a 
hidden Markov model is depicted in Figure 3. 

At pages 18-19, please replace paragraph(s) [0075]-[0078] with the following text: 
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A BLAST search was performed against the HMM database resulting in the identification of a 
region homologous to ProDom family PD313476("CG4435" SEQ ID NO: 17, ProDomain Release 
200 1.1; htt^www.toulouse.inra.fr/prodom.html). An alignment of the "CG4435" domain (amino acids 

89- 265 of SEQ ID NO:2) of human 80090 with a consensus amino acid sequence (SEQ ID NO: 17) 
derived from a hidden Markov model is depicted in Figure 4. The consensus sequence for SEQ ID 
NO: 1 7 is 42% identical over amino acids 89-265 of SEQ ID NO:2 as shown in Figure 4. 

A BLAST search was performed against the HMM database resulting in the identification of a 
region homologous to ProDom family PD002778("Transferase fucosyltransferase glycosyltransferase 
alpha- 12-fucosyltransferase galactoside transmembrance glycoprotein 3-L- signal-anchor golgi" SEQ ID 
NO: 1 8, ProDomain Release 2001 . 1 ; http^www.toulouse.inra.fr/prodom.html). An alignment of the 
"Transferase fucosyltransferase glycosyltransferase alpha- 12-fucosyltransferase galactoside 
transmembrance glycoprotein 3-L- signal-anchor golgi" domain (amino acids 221-367 of SEQ ID NO:2) 
of human 80090 with a consensus amino acid sequence (SEQ ID NO: 1 8) derived from a hidden Markov 
model is depicted in Figure 5. The consensus sequence for SEQ ID NO: 1 8 is 36% identical over amino 
acids 221-367 of SEQ IDNO:2 as shown in Figure 5. 

A BLAST search was performed against the HMM database resulting in the identification of a 
region homologous to ProDom family PD323544("CG9169" SEQ ID NO: 19, ProDomain Release 
2001 .1 ; httpr//www.toulouse.inra.fr/prodom.html). An alignment of the "CG91 69" domain (amino acids 

90- 390 of SEQ ID NO:2) of human 80090 with a consensus amino acid sequence (SEQ ID NO: 19) 
derived from a hidden Markov model is depicted in Figure 6. The consensus sequence for SEQ ID 
NO: 19 is 24% identical over amino acids 90-390 of SEQ ID NO:2 as shown in Figure 6. 

A BLAST search was performed against the HMM database resulting in the identification of a 
region homologous to ProDom family PD323544("CG9169" SEQ ID NO: 19, ProDomain Release 
200 1 . 1 ; httpr//www.toulouse.inra.fr/prodom.html). An alignment of the "CG9 1 69" domain (amino acids 
90-390 of SEQ ID NO:2) of human 80090 with a consensus amino acid sequence (SEQ ID NO: 19) 
derived from a hidden Markov model is depicted in Figure 6. The consensus sequence for SEQ ID 
NO: 19 is 24% identical over amino acids 90-390 of SEQ ID NO:2 as shown in Figure 6. 

At page 23, please replace paragraph(s) [0092]-[0093] with the following text: 

For general information regarding PFAM identifiers, PS prefix, and PF prefix domain 
identification numbers, refer to Sonnhammer et al. (1997) Protein 28:405-420 and 
http^www.psc.edu/general/ software/packages/pfam/pfam.html. 

A plasmid containing tho nucleotide soqu e no e onooding human 52 8 74 wao deposited with 
American Typo Culture Collection (ATCC), 10801 University Boulevard, Manassas, VA 201 10 2209, on 
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and Q33ign e d Accession Number . This deposit will be maintained under the terms of th e 

Budapest Tr e aty on the International Recognition of the Deposit of Microorganisms for the Purposes of 
Patent Procedure. This d e posit was made m e r e ly as a conv e nience for thos e of skill in the art and is not 
an admission that a deposit is required under 35 U.S.C. §112. 

At pages 24-25, please replace paragraph(s) [0098]-[0099] with the following text: 

For general information regarding PFAM identifiers, PS prefix, and PF prefix domain 
identification numbers, refer to Sonnhammer et al. (1997) Protein 28:405-420 and 
fettp^/www.psc.edu/general/ software/packages/pfam/pfam.html. 

A plasmid containing the nucleotide sequence encoding human 52880 was deposited with 
American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, VA 201 10 2209, on 

and assigned Acc e3 sion Number . This deposit will b e maintained under the terms of the 

Budap e st Tr e aty on the International Recognition of th e D e posit of Microorganisms for the Purposes of 
Patent Procedur e . This d e posit was made merely as a conveni e nc e for thos e of skill in the art and is not 
an admission that a deposit is required und e r 35 U.S.C. §112. 

At pages 26-27, please replace paragraph(s) [0104]-[0105] with the following text: 

For general information regarding PFAM identifiers, PS prefix, and PF prefix domain 
identification numbers, refer to Sonnhammer et al. (1997) Protein 28:405-420 and 
httpf//www.psc.edu/general/ software/packages/pfam/pfam.html. 

A plasmid containing th e nucl e otide sequence encoding human 63 4 97 was deposited with 
American Type Culture Collection (ATCC), 10801 Univ e rsity Boulevard, Manassas, VA 201 10 2209, on 

and assigned Accession Number . This deposit will be maintain e d under the terms of the 

Budap e st Tr e aty on th e Int e rnational Recognition of the Deposit of Microorganisms for the Purposes of 
Patent Procedure. This deposit was made m e r e ly as a conveni e nc e for those of skill in the art and is not 
an admission that a deposit is requir e d under 35 U.S.C. §112. 

At pages 27-28, please replace paragraph(s) [0108] with the following text: 

As used herein, the term "G protein-coupled receptor" or "GPCR" refers to a family of proteins 
that preferably comprise an N-terminal extracellular domain, seven transmembrane domains (also referred 
to as membrane-spanning domains), three extracellular domains (also referred to as extracellular loops), 
three cytoplasmic domains (also referred to as cytoplasmic loops), and a C-terminal cytoplasmic domain 
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(also referred to as a cytoplasmic tail). Members of the GPCR family also share certain conserved amino 
acid residues, some of which have been determined to be critical to receptor function and/or G protein 
signaling. For example, GPCRs usually contain the following features including a conserved asparagine 
residue in the first transmembrane domain. An alignment of the transmembrane domains of 44 
representative GPCRs can be found at http^mgdkkl. nidll.nih.gov: 8000/extended.html. 

At pages 31-32, please replace paragraph(s) [01 17]-[01 18] with the following text: 

To identify the presence of a 7 transmembrane receptor profile in a 52874, 52880, or 63497 
receptor, and make the determination that a polypeptide or protein of interest has a particular profile, the 
amino acid sequence of the protein can be searched against the Pfam database of HMMs (e.g., the Pfam 
database, release 2.1) using the default parameters 

(http^www.sanger.ac.ixk/Software/Pfani/HMM search). For example, the hmmsf program, which is 
available as part of the HMMER package of search programs, is a family specific default program for 
PF00001 and a score of 15 is the default threshold score for determining a hit. Alternatively, the 
threshold score for determining a hit can be lowered (e.g., to 8 bits). A description of the Pfam database 
can be found in Sonhammer et al (1997) Proteins 28:405-420 and a detailed description of HMMs can be 
found, for example, in Gribskov et al. (1990) Meth. Enzymol. 183: 146-1 59; Gribskov et al (1987) Proc. 
Natl. Acad. Sci. USA 84:4355-4358; Krogh etal. (1994)7. Mol Biol 235:1501-1531; and Stultz et al 
(1993) Protein Sci. 2:305-314, the contents of which are incorporated herein by reference. Alternatively, 
the seven transmembrane domain can be predicted based on stretches of hydrophobic amino acids 
forming a-helices (SOUSI server). For example, using a SOUSI server, two 7 TM receptor domain 
profiles were identified in the amino acid sequence of SEQ ID NO:5 (e.g., amino acids 40-120 and 142- 
293 of SEQ ID NO:5), one 7TM receptor domain profile in the amino acid sequence of SEQ ID NO: 8 
(e.g., amino acids 22-294 of SEQ ID NO:8), and one 7TM receptor domain profile in the amino acid 
sequence of SEQ ID NO:l 1 (e.g., amino acids 189-243 of SEQ ID NO:l 1). Accordingly, 52874, 52880, 
or 63497 proteins having at least 50-60% homology, preferably about 60-70%, more preferably about 70- 
80%, or about 80-90% homology with the 7 transmembrane receptor profile of human 52874, 52880, or 
63497 are within the scope of the invention. 

In one embodiment, a 52874, 52880, or 63497 protein includes at least one "7 transmembrane 
receptor" domain or regions homologous with a "7 transmembrane receptor" domain. As used herein, the 
term "7 transmembrane receptor" domain includes an amino acid sequence having at least about 10-350 
amino acid residues in length and having a bit score for the alignment of the sequence to the 7tm_l family 
Hidden Markov Model (HMM) of at least 8. Preferably, a "7 transmembrane receptor family" domain 
includes at least about 50-350 amino acid residues, more preferably about 75-300 amino acid residues, or 
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at least about 80-280 amino acids in length and having a bit score for the alignment of the sequence to the 
"7 transmembrane receptor family" domain (HMM) of at least 12 or greater. The "7 transmembrane 
receptor family" domain (HMM) has been assigned the PFAM Accession PF00001 
(h^r//pfam.wustl.edu/cgi-bin/getdesc?name=7tm_l). An alignment of the "7 transmembrane receptor 
family" domain (amino acids 40-120 and 142-293 of SEQ ID NO:5, 22-294 of SEQ ID NO:8 and 189- 
243 of SEQ ID NO: 1 1) of human 52874, 52880, or 63497 with a consensus amno acid sequence derived 
from a hidden Markov model is depicted in Figures 9A-B, 15, and 22. 

At pages 33-36, please replace paragraph(s) [0122]-[0129] with the following text: 

A BLAST search was performed against the HMM database resulting in the identification of 
regions homologous to ProDom family PD032606("Receptor neurotensin coupled G-protein type 
transmembrane lipoprotein levocabastine- palmitate phosphorylation" SEQ ID NO:22, ProDomain 
Release 2001 .1 ; httpr//www.toulouse.inra.fr/prodom.html). An alignment of the "Receptor neurotensin 
coupled G-protein type transmembrane lipoprotein levocabastine- palmitate phosphorylation" domain 
(amino acids 253-320 of SEQ ID NO:5) of human 52874 with consensus amino acid sequences (SEQ ID 
NO:22) derived from a hidden Markov model is depicted in Figure 10. The consensus sequence for SEQ 
ID NO:22 is 33% identical over amino acids 253-320 of SEQ ID NO:5 as shown in Figure 10. 

A BLAST search was performed against the HMM database resulting in the identification of 
regions homologous to ProDom family PD128109("Similar somatostatin receptors" SEQ ID NO:23, 
ProDomain Release 2001 .1 ; httpr//www .toulouse.inra.fr/prodom.html). An alignment of the "Similar 
somatostatin receptors" domain (amino acids 208-316 of SEQ ID NO:5) of human 52874 with consensus 
amino acid sequences (SEQ ID NO:23) derived from a hidden Markov model is depicted in Figure 1 1 . 
The consensus sequence for SEQ ID NO:23 is 24% identical over amino acids 208-316 of SEQ ID NO:5 
as shown in Figure 1 1 . 

A BLAST search was performed against the HMM database resulting in the identification of 
regions homologous to ProDom family PD145471("C01G12.7" SEQ ID NO:24, ProDomain Release 
2001.1; httpr//www.toulouse.inra.fr/prodom.html). An alignment of the "C01G12.7" domain (amino 
acids 16-192 of SEQ ID NO:5) of human 52874 with consensus amino acid sequences (SEQ ID NO:24) 
derived from a hidden Markov model is depicted in Figure 12. The consensus sequence for SEQ ID 
NO:24 is 20% identical over amino acids 16-192 of SEQ ID NO:5 as shown in Figure 12. 

A BLAST search was performed against the HMM database resulting in the identification of 
regions homologous to ProDom family PD310793("Receptor orphan GPR26 protein-coupled" SEQ ID 
NO:26, ProDomain Release 2001 . 1 ; http://www.toulouse.inra.fr/prodom.html). An alignment of the 
"Receptor orphan GPR26 protein-coupled" domain (amino acids 1 34-3 1 5 of SEQ ID NO:8) of human 
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52880 with consensus amino acid sequences (SEQ ID NO:26) derived from a hidden Markov model is 
depicted in Figure 16. The consensus sequence for SEQ ID NO:26 is 51% identical over amino acids 
134-315 of SEQ ID NO:8 as shown in Figure 16. 

A BLAST search was performed against the HMM database resulting in the identification of 
regions homologous to ProDom family PD155019("Receptor type hypocretin EG:22E5.10 EG:22E5.1 1 
transmembrane coupled orexin G-protein" SEQ ID NO:27, ProDomain Release 2001 .1 ; 
httpr//www.toulouse.inra.fr/prodom.html). An alignment of the "Receptor type hypocretin EG:22E5.10 
EG:22E5.1 1 transmembrane coupled orexin G-protein" domain (amino acids 175-321 of SEQ ID NO: 8) 
of human 52880 with consensus amino acid sequences (SEQ ID NO:27) derived from a hidden Markov 
model is depicted in Figure 17. The consensus sequence for SEQ ID NO:27 is 23% identical over amino 
acids 175-321 of SEQ ID NO: 8 as shown in Figure 17. 

A BLAST search was performed against the HMM database resulting in the identification of 
regions homologous to ProDom family PD032094("Receptor acid lysophosphatidic high-affinity 
homolog transmembrane novel thodopsin similar" SEQ ID NO:28, ProDomain Release 2001.1; 
http^www.toulouse.inra.fr/prodom.html). An alignment of the "Receptor acid lysophosphatidic high- 
affinity homolog transmembrane novel thodopsin similar" domain (amino acids 171-322 of SEQ ID 
NO: 8) of human 52880 with consensus amino acid sequences (SEQ ID NO:28) derived from a hidden 
Markov model is depicted in Figure 18. The consensus sequence for SEQ ID NO:28 is 22% identical 
over amino acids 171-322 of SEQ ID NO: 8 as shown in Figure 18. 

A BLAST search was performed against the HMM database resulting in the identification of 
regions homologous to ProDom family PD322057("NT2RM2000452 FLJ10317 Fis cDNA" SEQ ID 
NO:29, ProDomain Release 2001 . 1 ; http7//www.toulouse.inra.fr/prodom.html). An alignment of the 
"NT2RM2000452 FLJ10317 Fis cDNA" domain (amino acids 221-329 of SEQ ID NO:8) of human 
52880 with consensus amino acid sequences (SEQ ID NO:29) derived from a hidden Markov model is 
depicted in Figure 19. The consensus sequence for SEQ ID NO:29 is 30% identical over amino acids 
221-329 of SEQ ID NO: 8 as shown in Figure 19. 

A BLAST search was performed against the HMM database resulting in the identification of 
regions homologous to ProDom family PD009900("Receptor pheromone G-protein vomeronasal coupled 
M24 VN1 VN3 VN2 VN4" SEQ ID NO:31, ProDomain Release 2001.1; 
httpi//www.toulouse.inra.fr/prodom.html). An alignment of the "Receptor pheromone G-protein 
vomeronasal coupled M24 VN1 VN3 VN2 VN4" domain (amino acids 36-295 of SEQ ID NO:l 1) of 
human 63497 with consensus amino acid sequences (SEQ ID NO:31) derived from a hidden Markov 
model is depicted in Figure 23. The consensus sequence for SEQ ID NO:31 is 34% identical over amino 
acids 36-295 of SEQ ID NO: 1 1 as shown in Figure 23. 
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At page 42, please replace paragraph(s) [0149]-[0150] with the following text: 

For general information regarding PFAM identifiers, PS prefix, and PF prefix domain 
identification numbers, refer to Sonnhammer et al. (1997) Protein 28:405-420 and 
httpT/Avww.psc.edu/general/ software/packages/pfam/pfam.html. 

A plasmid containing the nucleotide sequence encoding human 33 4 25 was deposited with 
American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, VA 201 10 2209, on 

and assigned Acc e ssion Numb e r . This deposit will be maintained under the terms of the 

Budapest Treaty on the International Recognition of the Deposit of Microorganisms for the Purpos e s of 
Patent Procedure. This deposit w r as made merely as a convenience for those of skill in th e art and is not 
an admi s sion that a deposit is required under 35 U.S.C. § 1 12. 

At pages 43-44, please replace paragraph(s) [0157] with the following text: 

As used herein, the term "rhoGAP domain" includes an amino acid sequence of about 50-300 
amino acid residues in length and having a bit score for the alignment of the sequence to the rhoGAP 
domain (HMM) of at least 8. Preferably, a rhoGAP domain includes at least about 100-250 amino acids, 
more preferably about 100-200 amino acid residues, or about 140-160 amino acids and has a bit score for 
the alignment of the sequence to the rhoGAP domain (HMM) of at least 16, 25, 50, 100 or greater. The 
rhoGAP domain (HMM) has been assigned the PFAM Accession PF00620 (httpr//pfam.wustl.edu/). An 
alignment of the rhoGAP domain (amino acids 343-494 of SEQ ID NO: 14) of human 33425 with a 
consensus amino acid sequence derived from a hidden Markov model is depicted in Figure 26. 

At pages 44-45, please replace paragraph(s) [0159]-[0160] with the following text: 

To identify the presence of an "rhoGAP" domain in a 33425 protein sequence, and make the 
determination that a polypeptide or protein of interest has a particular profile, the amino acid sequence of 
the protein can be searched against a database of HMMs (e.g., the Pfam database, release 2.1) using the 
default parameters (httpr/Avww. sanger.ac.uk/Software/Pfam/HMM_search). For example, the hmmsf 
program, which is available as part of the HMMER package of search programs, is a family specific 
default program for MILPAT0063 and a score of 15 is the default threshold score for determining a hit. 
Alternatively, the threshold score for determining a hit can be lowered (e.g., to 8 bits). A description of 
the Pfam database can be found in Sonhammer et al., (1997) Proteins 28(3):405-420 and a detailed 
description of HMMs can be found, for example, in Gribskov et al., (1990) Meth. Enzymol 183:146-159; 
Gribskov et al., (1987) Proc. Natl Acad. Sci. USA 84:4355-4358; Krogh et al, (1994) J. Mol Biol 
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235:1501-1531; and Stultz et al., (1993) Protein Sci. 2:305-314, the contents of which are incorporated 
herein by reference. 

An additional method to identify the presence of a "RhoGAP" domain in a 33425 protein 
sequence, and make the determination that a polypeptide or protein of interest has a particular profile, the 
amino acid sequence of the protein can be searched against a SMART database (Simple Modular 
Architecture Research Tool, http^smart.embl-heidelberg.de/) of HMMs as described in Schultz et al 
(1 998), Proc. Natl Acad, Sci. USA 95:5857 and Schultz et al (2000) Nucl Acids Res 28:23 1 . The 
database contains domains identified by profiling with the hidden Markov models of the HMMer2 search 
program (R. Durbin et al (1998) Biological sequence analysis: probabilistic models of proteins and 
nucleic acids. Cambridge University Press.; httpr/Zhmmer. wustl.edu/). The database also is extensively 
annotated and monitored by experts to enhance accuracy. A search was performed against the HMM 
database resulting in the identification of a "RhoGAP_3" domain in the amino acid sequence of human 
33425 at about residues 340 to 520 of SEQ ID NO: 14 (see Figure 24A-C). 

At pages 45-46, please replace paragraph(s) [0162]-[0164] with the following text: 

A BLAST search was performed against the HMM database resulting in the identification of a 
region homologous to ProDom family PD301916("Similar NT2RM2000363 cluster Fis FLJ10312 weakly 
cDNA breakpoint" SEQ ID NO:33, ProDomain Release 2001.1; 

httpr//www.toulouse.inra.fr/prodom.html). An alignment of the "Similar NT2RM2000363 cluster Fis 
FLJ10312 weakly cDNA breakpoint" domain (amino acids 516-608 of SEQ ID NO:14) of human 33425 
with a consensus amino acid sequence (SEQ ID NO:33) derived from a hidden Markov model is depicted 
in Figure 27. The consensus sequence for SEQ ID NO:33 is 39% identical over amino acids 516-608 of 
SEQ ID NO: 14 as shown in Figure 27. 

A BLAST search was performed against the HMM database resulting in the identification of a 
region homologous to ProDom family PD000780("GTPase activating similar GTPase-activating 
activation domain Fis zinc cDNA subunit" SEQ ID NO:34, ProDomain Release 2001.1; 
ht^r//www.toulouse.inra.fr/prodom.html). An alignment of the "GTPase activating similar GTPase- 
activating activation domain Fis zinc cDNA subunit" domain (amino acids 390-486 of SEQ ID NO: 14) of 
human 33425 with a consensus amino acid sequence (SEQ ID NO:34) derived from a hidden Markov 
model is depicted in Figure 28. The consensus sequence for SEQ ID NO:34 is 36% identical over amino 
acids 390-486 of SEQ ID NO: 14 as shown in Figure 28. 

A BLAST search was performed against the HMM database resulting in the identification of a 
region homologous to ProDom family PD215173("RLIP" SEQ ID NO:35, ProDomain Release 2001.1; 
http7//www.toulouse.inra.fr/prodom.html). An alignment of the "RLIP" domain (amino acids 399-488 of 



(Page 13 of 22) 



Practitioner's Docket No. MPI00-437P1RM 



SEQ ID NO: 14) of human 33425 with a consensus amino acid sequence (SEQ ID NO:35) derived from a 
hidden Markov model is depicted in Figure 29. The consensus sequence for SEQ ID NO:35 is 34% 
identical over amino acids 399-488 of SEQ ID NO: 14 as shown in Figure 29. 

At page 59, please replace paragraph(s) [0263]-[0264] with the following text: 

A "non-essential" amino acid residue is a residue that can be altered from the wild-type sequence 
of 80090, 52874, 52880, 63497, or 33425(e.g., the sequence of SEQ ID NO:1,3,4,6,7,9,10,12,13 or 15^ 
th e nucl e otide sequence of the DNA insert of the plasmid d e posit e d with ATCC Q3 Accession Numb e r 

) without abolishing or more preferably, without substantially altering a biological activity, whereas 
an "essential" amino acid residue results in such a change. For example, amino acid residues that are 
conserved among the polypeptides of the present invention, e.g., those present in the fucosyltransferase, 
7TM receptor, or RhoGAP domains, are predicted to be particularly unamenable to alteration. 

A "conservative amino acid substitution" is one in which the amino acid residue is replaced with 
an amino acid residue having a similar side chain. Families of amino acid residues having similar side 
chains have been defined in the art. These families include amino acids with basic side chains (e.g., 
lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side 
chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains 
(e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched 
side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, 
tryptophan, histidine). Thus, a predicted nonessential amino acid residue in an 80090, 52874, 52880, 
63497, or 33425 protein is preferably replaced with another amino acid residue from the same side chain 
family. Alternatively, in another embodiment, mutations can be introduced randomly along all or part of 
an 80090, 52874, 52880, 63497, or 33425 coding sequence, such as by saturation mutagenesis, and the 
resultant mutants can be screened for 80090, 52874, 52880, 63497, or 33425 biological activity to 
identify mutants that retain activity. Following mutagenesis of SEQ ED NO:1,3,4 5 6,7,9,10, 12,13 or 15^6? 
th e nucl e otide sequence of the DNA insert of the plasmid deposited with ATCC as Accession Number 

, the encoded protein can be expressed recombinantly and the activity of the protein can be 
determined. 

At pages 61-62, please replace paragraph(s) [0268] with the following text: 

The comparison of sequences and determination of percent identity between two sequences can 
be accomplished using a mathematical algorithm. In a preferred embodiment, the percent identity 
between two amino acid sequences is determined using the Needleman and Wunsch (J. Mol Biol 
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(48):444-453 (1970)) algorithm which has been incorporated into the GAP program in the GCG software 
package (available at httpr//www. gcg.com), using either a Blossum 62 matrix or a PAM250 matrix, and a 
gap weight of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet another preferred 
embodiment, the percent identity between two nucleotide sequences is determined using the GAP 
program in the GCG software package (available at-httpr//www. gcg.com), using a NWSgapdna.CMP 
matrix and a gap weight of 40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. A particularly 
preferred set of parameters (and the one that should be used if the practitioner is uncertain about what 
parameters should be applied to determine if a molecule is within a sequence identity or homology 
limitation of the invention) is using a Blossum 62 scoring matrix with a gap open penalty of 12, a gap 
extend penalty of 4, and a frameshift gap penalty of 5. 

At page 62, please replace paragraph(s) [0270] with the following text: 

The nucleic acid and protein sequences described herein can be used as a "query sequence" to 
perform a search against public databases to, for example, identify other family members or related 
sequences. Such searches can be performed using the NBLAST and XBLAST programs (version 2.0) of 
Altschul, et ah, (1990) J. Mol Biol 215:403-10. BLAST nucleotide searches can be performed with the 
NBLAST program, score = 100, wordlength = 12 to obtain nucleotide sequences homologous to 80090, 
52874, 52880, 63497, or 33425 nucleic acid molecules of the invention. BLAST protein searches can be 
performed with the XBLAST program, score = 50, wordlength = 3 to obtain amino acid sequences 
homologous to 80090, 52874, 52880, 63497, or 33425 protein molecules of the invention. To obtain 
gapped alignments for comparison purposes, Gapped BLAST can be utilized as described in Altschul et 
al., (1997) Nucleic Acids Res. 25(17):3389-3402. When utilizing BLAST and Gapped BLAST programs, 
the default parameters of the respective programs (e.g., XBLAST and NBLAST) can be used. See 
http7//www.ncbi.nlm.nih.gov. 

At pages 64-66, please replace paragraph(s) [0278]-[0281] with the following text: 

In one embodiment, an isolated nucleic acid molecule of the invention includes the nucleotide 
sequence shown in SEQ ID NO: 1 , SEQ ID NO:4, SEQ ID NO:7, SEQ ID NO: 1 0, OR SEQ ID NO: 1 3rOf 
the nucleotide sequence of tho DNA insert of the plasmid deposited with ATCC as Accession Number 

, or a portion of any of these nucleotide sequences. In one embodiment, the nucleic acid molecule 
includes sequences encoding the human 80090, 52874, 52880, 63497, or 33425 protein (i.e., "the coding 
region", from nucleotides 163-1623 of SEQ ID NO:l, 32-1417 of SEQ ID NO:4, 210-1301 of SEQ ID 
NO:7, 152-1057 of SEQ ID NO:10, and 73-2764 of SEQ ID NO:13, including the terminal codon), as 
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well as 5' untranslated sequences (nucleotides 1-162 of SEQ ID NO: 1, 1-31 of SEQ ID NO:4, 1-209 of 
SEQ ID NO:7, 1-151 of SEQ ID NO:10, and 1-72 of SEQ ID NO:13). Alternatively, the nucleic acid 
molecule can include only the coding region of SEQ ID NO: 1 , SEQ ID NO:4, SEQ ID NO:7, SEQ ID 
NO: 10, or SEQ ID NO: 13 (e.g., nucleotides 163-1623 of SEQ ID NO:l, 32-1417 of SEQ ID NO:4, 210- 
1301 of SEQ IDNO:7, 152-1057 of SEQ IDNO:10, and 73-2764 of SEQ IDNO:13, corresponding to 
SEQ ID NO:3, 6, 9, 12, and 15) and, e.g., no flanking sequences which normally accompany the subject 
sequence. In another embodiment, the nucleic acid molecule encodes a sequence corresponding to the 
mature protein of SEQ ID NO:2,5,8,l 1 or 14. 

In another embodiment, an isolated nucleic acid molecule of the invention includes a nucleic acid 
molecule which is a complement of the nucleotide sequence shown in SEQ ID NO: 1,3,4,6,7,9,10,12,13 or 
15 , or the nucleotide sequence of th e DNA ins e rt of th e plasmid deposited with ATCC a3 Accession 
Number or a portion of any of these nucleotide sequences. In other embodiments, the nucleic acid 

molecule of the invention is sufficiently complementary to the nucleotide sequence shown in SEQ ID 
NO:1,3,4,6,7,9,10,12,13 or 15 , or the nucleotide sequence of the DNA ins e rt of th e plasmid deposited 
with ATCC as Acc e ssion Number — such that it can hybridize to the nucleotide sequence shown in 
SEQ ID NO:1,3,4,6,7,9,10,12,13 or 15 , or the nucleotide sequence of the DNA insert of the plasmid 
deposited with ATCC as Accession Number thereby forming a stable duplex. 

In one embodiment, an isolated nucleic acid molecule of the present invention includes a 
nucleotide sequence which is at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 
94%, 95%, 96%, 97%, 98%, 99%, or more homologous to the nucleotide sequence shown in SEQ ID 
NO:1,3,4,6,7,9,10,12,13 or 15 , or the nucleotide sequence of the DNA insert of the plasmid d e posit e d 
with ATCC as Accession Number In the case of an isolated nucleic acid molecule which is longer 

than or equivalent in length to the reference sequence, e.g., SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, 
SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, or SEQ 
ID NO: 15, the comparison is made with the full length of the reference sequence. Where the isolated 
nucleic acid molecule is shorter than the reference sequence, e.g., shorter than SEQ ID NO:l, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO: 12, 
SEQ ID NO: 1 3, or SEQ ID NO: 1 5, the comparison is made to a segment of the reference sequence of the 
same length (excluding any loop required by the homology calculation). 

80090. 52874, 52880. 63497. or 33425 Nucleic Acid Fragments 

A nucleic acid molecule of the invention can include only a portion of the nucleic acid sequence 
of SEQ ID NO: 1 ,3,4,6,7,9, 1 0, 1 2, 1 3 or 1 5 , or the nucleotide sequence of the DNA insert of the plasmid 
deposited with ATCC as Accession Number For example, such a nucleic acid molecule can 

include a fragment which can be used as a probe or primer or a fragment encoding a portion of an 80090, 
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52874, 52880, 63497, or 33425 protein, e.g., an immunogenic or biologically active portion of an 80090, 
52874, 52880, 63497, or 33425 protein. A fragment can comprise: nucleotides 265-1347 of SEQ ID 
NO:l, 149-391 ofSEQIDNO:4, 455-910 of SEQ ID NO:4, 273-1091 of SEQ ID NO:7, 716-880 of SEQ 
ID NO:10, or 1099-1554 of SEQ ID NO:13, which encodes an fucosyltransferase, 7TM receptor, or 
RhoGAP domain of human 80090, 52874, 52880, 63497, or 33425. The nucleotide sequence determined 
from the cloning of the 80090, 52874, 52880, 63497, or 33425 gene allows for the generation of probes 
and primers designed for use in identifying and/or cloning other 80090, 52874, 52880, 63497, or 33425 
family members, or fragments thereof, as well as 80090, 52874, 52880, 63497, or 33425 homologues, or 
fragments thereof, from other species. 

At page 66, please replace paragraph(s) [0284] with the following text: 

80090, 52874, 52880, 63497, or 33425 probes and primers are provided. Typically a 
probe/primer is an isolated or purified oligonucleotide. The oligonucleotide typically includes a region of 
nucleotide sequence that hybridizes under stringent conditions to at least about 7, 12 or 15, preferably 
about 20 or 25, more preferably about 30, 35, 40, 45, 50, 55, 60, 65, or 75 consecutive nucleotides of a 
sense or antisense sequence of SEQ ID NO:1,3,4,6,7,9,10,12,13 or 1 5 , or the nucleotide sequence of the 
DNA insert of the plasmid deposited with ATCC a3 Accession Number or of a naturally occurring 

allelic variant or mutant of SEQ ID NO: 1,3, 4,6,7,9, 10, 12, 13 or 1 5 , or th e nucleotide sequence of the DNA 
insert of the plasmid deposited with ATCC as Acce33ion Number 

At pages 67-68, please replace paragraph(s) [0289]-[0291] with the following text: 

A nucleic acid fragment encoding a "biologically active portion of an 80090, 52874, 52880, 
63497, or 33425 polypeptide" can be prepared by isolating a portion of the nucleotide sequence of SEQ 
ED NO: 1,3,4,6,7,9,10,12,13 or 15 , or the nucleotide sequence of the DNA insert of the plasmid deposited 
with ATCC as Acce33ion Number which encodes a polypeptide having an 80090, 52874, 52880, 

63497, or 33425 biological activity (e.g., the biological activities of the 80090, 52874, 52880, 63497, or 
33425 proteins as described herein), expressing the encoded portion of the 80090, 52874, 52880, 63497, 
or 33425 protein (e.g., by recombinant expression in vitro) and assessing the activity of the encoded 
portion of the 80090, 52874, 52880, 63497, or 33425 protein. For example, a nucleic acid fragment 
encoding a biologically active portion of 80090, 52874, 52880, 63497, or 33425 includes a 
fucosyltransferase, 7TM receptor, or RhoGAP domain (e.g., about amino acid residues 35-395 of SEQ ID 
NO:2, 40-120 and 142-293 of SEQ ID NO:5, 22-294 of SEQ ID NO:8, 189-243 of SEQ ID NO:l 1, or 
343-494 of SEQ ID NO: 14). A nucleic acid fragment encoding a biologically active portion of an 80090, 
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52874, 52880, 63497, or 33425 polypeptide, may comprise a nucleotide sequence which is greater than 
300-1200 or more nucleotides in length. 

In preferred embodiments, nucleic acids include a nucleotide sequence which is about 300, 400, 
500, 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400 nucleotides in length and hybridizes under 
stringent hybridization conditions to a nucleic acid molecule of SEQ ED NO:l, SEQ ID NO:3, SEQ ID 
NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:12, SEQ ID NO:13, or 
SEQ ID NO: 1 5 , or th e nucleotid e sequ e nce of th e DNA insert of th e pla3mid deposited with ATCC as 
Accession Number 



80090. 52874. 52880. 63497. or 33425 Nucleic Acid Variants 

The invention further encompasses nucleic acid molecules that differ from the nucleotide 
sequence shown in SEQ ID NO: 1,3, 4,6,7,9, 10, 12, 13 or 15 , or the nucleotide sequ e nce of the DNA insert 
of the plasmid deposited with ATCC a3 Accession Number Such differences can be due to 

degeneracy of the genetic code (and result in a nucleic acid which encodes the same 80090, 52874, 
52880, 63497, or 33425 proteins as those encoded by the nucleotide sequence disclosed herein. In 
another embodiment, an isolated nucleic acid molecule of the invention has a nucleotide sequence 
encoding a protein having an amino acid sequence which differs, by at least 1, but less than 5, 10, 20, 50, 
or 100 amino acid residues that shown in SEQ ID NO: 2, 5, 8,1 1 or 14. If alignment is needed for this 
comparison the sequences should be aligned for maximum homology. "Looped" out sequences from 
deletions or insertions, or mismatches, are considered differences. 

At page 69, please replace paragraph(s) [0294] with the following text: 

In a preferred embodiment, the nucleic acid differs from that of SEQ ID NO: 1 ,3,4,6,7,9, 1 0, 1 2, 1 3 or 
15 , or the nucl e otid e s e qu e nc e of th e DNA ins e rt of the plasmid deposited with ATCC as Accession 
Number e.g., as follows: by at least one but less than 10, 20, 30, or 40 nucleotides; at least one but 

less than 1%, 5%, 10% or 20% of the in the subject nucleic acid. If necessary for this analysis the sequences 
should be aligned for maximum homology. "Looped" out sequences from deletions or insertions, or 
mismatches, are considered differences. 

At page 70, please replace paragraph(s) [0297] with the following text: 

Moreover, nucleic acid molecules encoding other 80090, 52874, 52880, 63497, or 33425 family 
members and, thus, which have a nucleotide sequence which differs from the 80090, 52874, 52880, 
63497, or 33425 sequences of SEQ ID NO:1,3,4,6,7,9,10,12,13 or 1 5 , or the nucleotide sequence of the 



(Page 18 of 22) 



• Practitioner's Docket No. MPI00-437P1RM 

DNA ins e rt of the plasmid deposited with ATCC as Acc e ssion Number are intended to be within 

the scope of the invention. 

At page 109, please replace paragraph(s) [0439] with the following text: 

The isolated mRNA can be used in hybridization or amplification assays that include, but are not 
limited to, Southern or Northern analyses, polymerase chain reaction analyses and probe arrays. One 
preferred diagnostic method for the detection of mRNA levels involves contacting the isolated mRNA 
with a nucleic acid molecule (probe) that can hybridize to the mRNA encoded by the gene being detected. 
The nucleic acid probe can be, for example, a full-length 80090, 52874, 52880, 63497, or 33425 nucleic 
acid, such as the nucleic acid of SEQ ID NO: 1 , or the DNA insert of the plasmid deposit e d with ATCC as 
Accession Number or a portion thereof, such as an oligonucleotide of at least 7, 15, 30, 50, 100, 

250 or 500 nucleotides in length and sufficient to specifically hybridize under stringent conditions to 
80090, 52874, 52880, 63497, or 33425 mRNA or genomic DNA. Other suitable probes for use in the 
diagnostic assays are described herein. 
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